Information Retrieval Baselines for the ResPubliQA Task

نویسندگان

  • Joaquín Pérez-Iglesias
  • Guillermo Garrido
  • Álvaro Rodrigo
  • Lourdes Araujo
  • Anselmo Peñas
چکیده

This paper describes the baselines proposed for the ResPubliQA task. These baselines are purely based on information retrieval techniques. The selection of an adequate retrieval model that fit the specific characteristic of the supplied data is considered as a core part of the task. Applying a not adequate retrieval function would return a subset of paragraphs where the answer could not appear, and thus the posterior techniques applied in order to detect the answer within the subset of candidates paragraphs will fail. In order to check the ability to retrieve the right paragraph by a pure information retrieval approach, two baselines are proposed. Both of them use the Okapi-BM25[2] ranking function, with and without a stemming pre-process respectively. The main aim was to prove how well can a pure information retrieval system perform on this task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

JU_CSE_TE: System Description QA@CLEF 2010 - ResPubliQA

Abstr act. The article presents the experiments carried out as part of the participation in the Paragraph Selection (PS) Task and Answer Selection (AS) Task of QA@CLEF 2010 – ResPubliQA. Our System use Apache Lucene for document retrieval system. All test documents are indexed using Apache Lucene. Stop words are removed from each question and query words are identified to retrieve the most rele...

متن کامل

Document Expansion for Cross-Lingual Passage Retrieval

This article describes the participation of the joint Elhuyar-IXA group in the ResPubliQA exercise at QA&CLEF 2010. In particular, we participated in the English–English monolingual task and in the Basque– English cross-lingual one. Our focus was threefold: (1) to check to what extent information retrieval (IR) can achieve good results in passage retrieval without question analysis and answer v...

متن کامل

Temporal Information Needs in ResPubliQA: an Attempt to Improve Accuracy. The UC3M Participation at CLEF 2010

The UC3M team participates in 2010 in the second ResPubliQA evaluation campaign taking part in the monolingual Spanish task. On this occasion we have completely redesigned our Question Answering system, product of multiple efforts while being part of the MIRACLE team, by creating a whole new architecture. The aim was to gain in modularity, flexibility and evaluation capabilities that previous v...

متن کامل

Overview of ResPubliQA 2009: Question Answering Evaluation over European Legislation

This paper describes the first round of ResPubliQA, a Question Answering (QA) evaluation task over European legislation, proposed at the Cross Language Evaluation Forum (CLEF) 2009. The exercise consists of extracting a relevant paragraph of text that satisfies completely the information need expressed by a natural language question. The general goals of this exercise are (i) to study if the cu...

متن کامل

The LogAnswer Project at ResPubliQA 2010

The LogAnswer project investigates the potential of deep linguistic processing and logical reasoning for question answering. The paragraph selection task of ResPubliQA 2010 offered the opportunity to validate improvements of the LogAnswer QA system that reflect our experience from ResPubliQA 2009. Another objective was to demonstrate the benefit of QA technologies over a pure IR approach. Two r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009